AV timing measurement and correction for digital television

ABSTRACT

An invention for measuring, maintaining and correcting synchronization between signals which suffer varying relative delays during transmission and/or storage is shown. The present invention teaches measuring the relative delay between a plurality of signals which have suffered differing delays due to transmission, storage or other processing. The preferred embodiment of the invention includes the use of a marker which is generated in response to a second signal and combined with a first signal in a manner which ensures that the marker will not be lost in the expected processing of the first signal. Subsequently a first delayed marker is generated in response to the marker associated with or recovered from the first signal, and a second delayed marker is generated from the second signal. The first delayed marker and second delayed marker are compared to determine a measure of the relative timing or delay between said first signal and said second signal at said subsequent time.

This application is a Division of application Ser. No. 14/522,494 filed Oct. 23, 2014 and issues as U.S. Pat. No. 9,386,192 on Jul. 5, 2016, which is a Division of application Ser. No. 14/170,786 filed Feb. 3, 2014 and issued as U.S. Pat. No. 9,071,723 on Jun. 30, 2012, which is a Division of application Ser. No. 13/347,633 filed Jan. 10, 2012 and issued as U.S. Pat. No. 8,810,659 on Aug. 19, 2014, which is a Division of application Ser. No. 12/471,127 filed May 22, 2009 and issued as U.S. Pat. No. 8,159,610 on Apr. 17, 2012, which is a Division of Ser. No. 10/894,746 filed Jul. 19, 2004 and issued as U.S. Pat. No. 7,710,499 on May 4, 2010, which is a Division of application Ser. No. 09/545,529 filed Apr. 7, 2000 and issued as U.S. Pat. No. 6,836,295 on Dec. 28, 2004, all of which are incorporated herein by reference as fully as if they had been set out in detail and to which priority is claimed. Application Ser. No. 09/545,529 is in turn a Continuation-in-part of application Ser. No. 09/119,524 filed Jul. 21, 1998 and issued as U.S. Pat. No. 6,351,281 on Feb. 26, 2002, which is a Division of application Ser. No. 08/620,126 filed Mar. 21, 1996 which issued as U.S. Pat. No. 6,330,033 on Dec. 11, 2001, which claims benefit of Provisional Application 60/008,309 filed Dec. 7, 1995. No priority to application Ser. No. 09/119,524, U.S. Pat. No. 6,351,281, application Ser. No. 08/620,126, U.S. Pat. No. 6,330,033 or Provisional Application 60/008,309 is claimed, but which are incorporated herein by reference, in respect to their prior art teachings.

The examiner's attention is called to incorrectly published U.S. Pat. No. 5,847,769 which is related to the present application by virtue of common application Ser. No. 08/620,126. The '769 patent was withdrawn from issue. Despite the fact of the patent being withdrawn from issue it was nevertheless published by the Patent Office. Applicant brings this withdrawn patent to the attention of the examiner out of applicant's duty of candor.

BACKGROUND OF THE INVENTION

The invention relates to measuring, maintaining and correcting synchronization between two signals which suffer varying relative delays during transmission and/or storage, and in particular to measuring the relative delay between multiple audio signals and an associated video signal of a television type program which is compressed via MPEG or other compression method for transmission and/or storage.

1. Field of the Invention

The present invention relates to the field of transmitting and storing multiple electronic signals where synchronization of the signals is of concern. When such transmitting and storing are of a nature which makes the corresponding receiving and recovering of said signals subject to timing errors resulting from differing amounts of processing delays the present invention is useful in measuring the relative timing errors or delays between signals with such delay measurement being used as a meter of quality of the transmitting and storing and for maintaining or correction of relative delays between such signals.

2. Description of Related Prior Art

It is known in the television signal transmission field to measure and correct audio to video timing errors by measuring the delay which a video signal experiences and using that measurement to delay a companion audio signal by a corresponding amount.

U.S. Pat. No. 4,313,135 by the present inventor shows to compare relatively undelayed and delayed versions of the same video signal to provide a delay signal responsive to the delay thereof and to couple that delay signal to a variable audio delay to cause the audio delay to delay the companion audio signal by a corresponding amount.

U.S. Pat. Nos. 4,665,431 and 5,675,388 by the present inventor show transmitting an audio signal as part of a video signal so that both the audio and video signals experience the same transmission delays thus maintaining the relative synchronization therebetween.

U.S. Reissue Pat. No. RE 33,535 corresponding to U.S. Pat. No. 4,703,355 shows in the preferred embodiment to encode in the vertical interval of a video signal, a timing signal derived from an audio signal and transmitting the combined video signal and the audio signal. At the receiving location the timing signal is recovered from the video signal and a new timing signal is generated from the received audio signal. The two timing signals are compared at the receiving location to determine the relative delay between the timing signal recovered from the video and the newly generated timing signal, thus determining the relative delay between the video and audio signals at the receive location. It is also suggested to put a timing signal in the audio signal.

U.S. Pat. No. 5,202,761 by the present inventor shows in the preferred embodiment to encode a pulse in the vertical interval of a video signal before the video signal is delayed. The encoded pulse is recovered from the vertical interval of the delayed video signal. Various methods responsive to the encoded pulse or the timing thereof for the undelayed video and the encoded pulse recovered from the vertical interval of the delayed video are shown which enable the determination of the delay, or the control of a corresponding audio delay.

U.S. Pat. No. 5,530,483 by the present inventor shows determining video delay by sampling an image of the undelayed video and sampling images, including the same image of the delayed version of the video and comparing the samples of the undelayed image to the samples of the delayed images until a match is found indicating that the undelayed image in delayed form is being compared. The time lapse between the sampling of the undelayed image, and the finding of the matching delayed image is used as a measure of video signal delay.

U.S. Pat. No. 5,572,261 by the present inventor shows a method of determining the relative delay between an audio and a video signal by inspecting the video for a speaker's mouth and determining various mouth patterns of movement which correspond to sounds which are present in the audio signal. The time relationship between a mouth pattern which creates a sound and the occurrence of that sound in the audio is used as a measure of audio to video timing.

U.S. Pat. No. 5,751,368, a CIP of U.S. Pat. No. 5,530,483 shows the use of comparing samples of relatively delayed and undelayed versions of video signal images for determining the delay of multiple signals.

Applicant incorporates all of the above prior art patents herein as fully as if they were set forth in their entirety for the purposes of enabling one of ordinary skill in the art to practice the present invention in so far as the present invention utilizes many elements which are taught therein. In particular, attention is called to U.S. Pat. No. RE 33,535 and the teachings of generating a timing signal in response to an audio signal, and the comparison of a recovered timing signal and a newly generated timing signal at the receiving site to determine the relative delay therebetween.

The above cited inventions often prove to be less than complete solutions for modern television systems and others which transmit or store a plurality of signals for various reasons including for example those problems recited below. In particular, the current transmission of MPEG compressed television signals has proven to have particular difficulty in maintaining audio to video synchronization, and the prior art has particular problems in dealing with such.

U.S. Pat. No. 4,313,135 compares relatively undelayed and delayed versions of the same video signal to provide a delay signal. This method requires connection between the undelayed site and the delayed site and is unsuitable for environments where the two sites are some distance apart. For example where television programs are sent from the network in New York to the affiliate station in Los Angeles such system is impractical because it would require the undelayed video to be sent to the delayed video site in Los Angeles without appreciable delay, somewhat of an oxymoron when the problem is that the transmission itself creates the delay which is part of the problem. A problem also occurs with large time delays such as occur with storage such as by recording since by definition the video is to be stored and the undelayed version is not available upon the subsequent playback or recall of the stored video.

U.S. Pat. Nos. 4,665,431 and 5,675,388 show transmitting an audio signal as part of a video signal so that both the audio and video signals experience the same transmission delays thus maintaining the relative synchronization therebetween. This method is expensive for multiple audio signals, and the digital version has proven difficult to implement when used in conjunction with video compression such as MPEG.

U.S. Reissue Pat. No. RE 33,535 corresponding to U.S. Pat. No. 4,703,355 shows in the preferred embodiment to encode a timing signal in the vertical interval of a video signal and transmitting the video signal with the timing signal. Unfortunately many systems strip out and fail to transmit the entire vertical interval of the video signal thus causing the timing signal to be lost. It is suggested to put a timing signal in the audio signal, which is continuous thus reducing the probability of losing the timing signal. Unfortunately it is difficult and expensive to put a timing signal in the audio signal in a manner which ensures that it will be carried with the audio signal, is easy to detect, and is inaudible to the most discerning listener.

U.S. Pat. No. 5,202,761 shows to encode a pulse in the vertical interval of a video signal before the video signal is delayed. This method also suffers when the vertical interval is lost.

U.S. Pat. No. 5,530,483 shows determining video delay by a method which includes sampling an image of the undelayed video. This method also requires the undelayed video, or at least the samples of the undelayed video, be available at the receiving location without significant delay. Like the '135 patent above this method is unsuitable for long distance transmission or time delays resulting from storage.

U.S. Pat. No. 5,572,261 shows a method of determining the relative delay between an audio and a video signal by inspecting the video for particular sound generating events such as a particular movement of a speaker's mouth and determining various mouth patterns of movement which correspond to sounds which are present in the audio signal. The time relationship between a video event such as mouth pattern which creates a sound and the occurrence of that sound in the audio is used as a measure of audio to video timing. This method requires a significant amount of audio and video signal processing to operate.

U.S. Pat. No. 5,751,368, a CIP of U.S. Pat. No. 5,530,483 shows the use of comparing samples of relatively delayed and undelayed versions of video signal images for determining the delay of multiple signals. Like the '483 patent the '368 patent needs for the undelayed video or at least samples thereof to be present at the receiving location.

U.S. Pat. No. 6,330,033 and Division U.S. Pat. No. 6,351,281 show a delay tracker for a signal processing system, where the delay tracker utilizes a special code or pulse associated with the tracked signal with the system including a pulse detector later recognizing the special code or pulse in order to identify such signal and ascertain any delays associated with the signal including possible resynchronization of associated signals. In the preferred embodiment the invention is utilized with video and audio signals to measure or maintain lip sync. The delay tracker is associated with the video signal in a manner that it will be carried through the processing that it is expected to receive. In one particular example the tracker which is associated with the video signal is generated in response to certain artifacts or characteristics already present in the audio signal.

The instant invention provides for improvements in the field of transmitting and storing multiple electronic signals where synchronization of the signals is of concern, for example related to U.S. Pat. Nos. 6,330,033 and 6,351,281.

Attempts have been made to add various timing related signals in television program streams in order to maintain audio to video synchronization. In particular in MPEG systems control signals such as time stamps are utilized. Unfortunately the inclusion of these signals does not guarantee proper audio to video synchronization at the receive side output of the system for a variety of reasons, including the fact that there are significant video delays which occur which cannot be tracked by the time stamps.

BRIEF SUMMARY OF THE INVENTION

It is an object of the invention to provide a method for measuring or maintaining the relative delay of a plurality of signals which are passed through subsequent processing.

It is another object of the invention to provide a method of generating a marker in response to a second signal which marker may be associated with a first signal in a fashion that said marker is carried with said first signal through processing of said first signal.

It is still another object of the invention to provide a method of responding to a marker which has been associated with a first signal and a marker which is provided in response to a second signal whereby said markers may be utilized to determine the relative delay between said first and second signals.

It is a further object of the invention to provide a marker in response to a signal wherein said marker indicates the occurrence of particular characteristics of said signal.

It is a still further object of the invention to provide a system of measuring the relative delay between an audio and a video signal in a television system wherein the audio and video signals are subject to differing processing which creates unequal delays in said signals.

It is yet still a further object of the invention to provide a method of marking a first signal which may be a video signal to allow relative delay measurement of said first signal and a second signal which may be an audio signal after they have been processed, including use of a marker generator responsive to the second signal to generate a marker upon the occurrence of one or more particular characteristics of the audio, associating the marker with the video signal in a fashion such that the marker will be carried with the video signal and not be adversely affected by the subsequent processing thereof.

It is yet still another object of the invention to provide a relative delay measurement system for measuring the relative delay between a plurality of signals including a first signal which is a video signal and second signal which is an audio signal which signals experience unequal delays due to processing thereof, the invention including use of a marker generator responsive to the audio signal to generate a marker upon the occurrence of one or more particular characteristics of the audio, associating the marker with the video signal in a fashion such that the marker will be carried with the video signal but not be adversely affected by the subsequent processing thereof, responding to the marker with the video signal after the processing to generate a first delayed marker; generating a second delayed marker in response to the processed audio signal, comparing the relative timing of the first and second delayed markers to determine the relative timing between the processed audio and processed video signal.

The preferred embodiment of the invention may be used with a television signal. At the transmitting location a marker is generated in response to the audio signal and is associated with the video signal such that the marker is carried with the video signal in a fashion such that it will not be lost or adversely affected by the expected processing of the video signal. The audio signal and the marker associated video signal are stored, transmitted and/or processed and made available at a later time thus becoming delayed video and audio signals. A first delayed marker is recovered from the delayed video signal and a corresponding second delayed marker is generated from the delayed audio signal, with the two delayed markers compared to determine the relative delay therebetween. This relative delay between these markers is responsive to and is a measure of the delay between the delayed video signal and delayed audio signal.

Somewhat simplistically stated, the preferred embodiment of the invention operates by generation of the marker at the transmit section, which may be thought of a marking the video at the time of the occurrence of a known event in the audio signal. The time marker is associated with the video signal such that it is carried in time with the video signal for all of the processing which the video signal is to experience. After the video signal processing and any audio signal processing, the same event in the audio is again marked in time, and the previously marked time (relative to the video) is recovered or flagged in the received video. Since it is known that the audio event and the marking of the video occurred (substantially) simultaneously at the transmit location, the displacement between those events at the receive location is a measure of the audio to video timing error, or the relative delay therebetween.

Generally, the present invention teaches measuring the relative delay between a plurality of signals which have suffered differing delays due to transmission, storage or other processing. The preferred embodiment of the invention includes the use of a marker which is generated in response to a second signal and combined with a first signal in a manner which ensures that the marker will not be lost in the expected processing of the first signal. Subsequently a first delayed marker is generated in response to the marker associated with or recovered from the first signal, and a second delayed marker is generated from the second signal. The first delayed marker and second delayed marker are compared to determine a measure of the relative timing or delay between said first signal and said second signal at said subsequent time.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows a block diagram of the preferred embodiment of the invention as used with a television audio and video signal.

FIG. 2 shows a block diagram of the marker generator 3 and 13 of the preferred embodiment of the invention.

DETAILED DESCRIPTION OF THE INVENTION

In FIG. 1 the preferred embodiment of the invention which is given by way of example, a video signal 1 and an audio signal 2 are present at what will be referred to as the transmit location. Either or both the video and audio signals may be in analog or digital, compressed or uncompressed form, the many variations and versions of which are well known in the art. Further. while the preferred embodiment is shown in respect to one video and one audio signal, it will be appreciated from the teachings herein that the invention may be utilized and practiced with multiple video and/or audio signals. In particular, by way of example the invention may be practiced with video and stereo (2 channel), surround (4+channel) or 5.1 channel audio systems as are contemplated for the new U.S. digital and HDTV transmission standards. It is also noted that the components of the invention may be implemented by analog, digital or software means or combinations thereof.

A marker generator 3 is responsive to the audio signal, and may be responsive to the video signal as indicated by the dashed line. In response to detecting the occurrence of one or more particular feature or characteristic of the audio signal generates a marker. One of ordinary skill in the art will recognize that element 44 of U.S. Pat. No. RE 33,535 may be utilized as element 3 herein. Other constructions and operations of 3 will also be known to one of ordinary skill from the present teachings. The particular features, characteristics, occurrences or other event in the audio signal which will result in the marker, will be referred to hereinafter as occurrences and the marker in its various forms will sometimes be referred to simply as a marker, one of ordinary skill understanding from the context and the teachings herein the specificity of the form or forms being referred to.

The marker from 3 is associated with the video signal 1 in a marker associator 4. One of ordinary skill in the art will recognize that element 10 of the U.S. Pat. No. 6,330,033 specification can be used for element 4 herein. Other constructions and operations of 4 will also be known to one of ordinary skill from the present teachings. The marker is preferred to be associated with the video signal in a fashion that the marker will not be lost, corrupted or modified beyond use by subsequent processing of the video signal. In particular it is preferred to associate the marker with the video signal by including the marker within the active picture information of the video signal in one of the manners disclosed in detail in the U.S. Pat. No. 6,330,033 specification. Consequently the marker may take on a form of active video, whatever form the video may be in.

Alternatively, the marker may be associated with the video signal by being encoded in the active video in a relatively invisible fashion by utilizing one of the various watermark techniques which are well known in the art. Watermarking is well known as a method of encoding the ownership or source of images in the image itself in an invisible, yet recoverable fashion. In particular known watermarking techniques allow the watermark to be recovered after the image has suffered severe processing of many different types. Such watermarking allows reliable and secure recovery of the marker after significant subsequent processing of the active portion of the video signal. By way of example, the marker of the present invention may be added to the watermark, or replace a portion or the entirety of the watermark, or the watermarking technique simply adapted for use with the marker. It is believed that this use of watermarking techniques to associate marker signals with video signals for audio to video timing purposes is novel and previously unknown to those in the art. Other methods of associating the marker with the video signal will be known to those of ordinary skill in the art from the teachings herein.

The video signal with the marker is output from 4 and coupled to the video encoder 5. The video encoder 5 is used by way of example in the present description to represent that part of the subsequent video processing which may take place at the transmitting side of the system. For example, the video encoder may include MPEG preprocessing and compression circuits. Similarly, the audio 2 is coupled to an audio encoder 6 which is used by way of example in the present description to represent the audio processing which may take place at the transmitting side of the system. For example, the audio encoder may include an MPEG compression circuit. The compressed video and audio signals are combined by video and audio combiner 7 and the combined signals are coupled to the transmission channel 8.

The audio and video signals from the transmission channel 8 are coupled to a video and audio separator 9 which separates the audio and video signal components of the transmitted signal(s). The audio and video signals are coupled to audio decoder 11 and video decoder 10 respectively, where they are decoded back into decoded audio 17 and decoded video 16 respectively.

At the receiving side, marker separator 12 responds to the marker which was previously combined in the video signal by 4 to provide a first delayed marker to 14. The first delayed marker may be in the same form or different form as the marker which is associated with the video. It is preferred that the marker be recovered from the video and provided as the first delayed marker, however it is sufficient to merely detect the presence of the marker in the video and generate a first delayed marker in response thereto. One of ordinary skill in the art will recognize that element 40 of the U.S. Pat. No. 6,330,033 specification may be utilized for element 12 herein. Other constructions and operations of 12 will also be known to one of ordinary skill from the present teachings.

Also at the receiving side, another marker generator 13, similar to 3, generates a second delayed marker in response to the same audio signal occurrences in the receive section audio from 11 as did the marker generator 3 on the transmit section in response to audio signal 2. Marker generator 13 may also be responsive to video in a fashion as previously described for 3 as shown by 19. The second delayed marker generated by 13 need not be in the same form as the marker generated by 3, but is preferred to be in the same form as the first delayed marker provided by 12.

The first and second delayed markers from 12 and 13 are coupled to the relative timing comparison 14. The relative timing comparison is responsive to these delayed markers to determine the timing between corresponding pairs thereof to determine the relative timing between them. In other words the relative timing comparison 14 determines the delay 15 of the later of the two delayed markers relative to the earlier, indicating both the magnitude of the delay and which signal is more delayed. One of ordinary skill in the art will recognize from the teachings herein that relative timing comparison 14 may operate as described with respect to element 50 of the U.S. Pat. No. 6,330,033 specification. Other constructions and operations of 14 will also be known to one of ordinary skill from the present teachings.

Since the first delayed marker from 12 experiences the delay of the video signal 1, and the second delayed marker from 13 experiences the delay of the audio signal 2 in their respective paths from the input of the transmit section to the output of the receive section, signal 15 is a measure of the relative delay of audio 17 and video 16 at the output of the receive section.

The relative delay 15 may be utilized for all of the uses and reasons set forth in the U.S. Pat. No. 6,330,033 specification. In particular note that the relative delay signal 15 is useful in itself as a measure of system quality. Relative delay signal 15 may be utilized to control a delay to delay the earlier of 16 or 17 to place the two signals into synchronization. Relative delay signal 15 may also be utilized to control a delay which is incorporated into 10 or 11 or both (or elsewhere in the system) to control the delay of the earlier of the audio or video from 9 to maintain the two signals 16 and 17 in synchronization. Relative delay signal 15 may also be utilized for other purposes, for example as feedback to control the operation of encoder 5 or 6 or decoder 10 or 11 to minimize or otherwise optimize delay or encoding and decoding of audio or video.

Various different embodiments of the invention herein described will be apparent to one of ordinary skill in the art from the teachings herein. As an example, the marker generator 3 may be responsive to the video signal as shown by 18 in order to relate the marker to the video signal, for example to properly locate the marker for combination with the video signal or to relate the particular feature(s) of the audio signal to timing of the video signal. In particular, it is desired that the marker represent whether or not the particular features occurred in the audio signal during the one or more frame or field immediately prior to the marker being combined with the video and going back to the time when the immediately previous marker was combined.

The marker is preferred to be a binary signal which indicates that one or more of a number of particular occurrences of the audio which has taken place during the preceding field(s), or is currently taking place. For example, an 8 bit binary signal may be utilized with different numbers corresponding to different occurrences or features. In the preferred embodiment, it is preferred that the audio signal, which in the present example is assumed to have a bandwidth of 10 Hz to 20,000 Hz be broken up into 8 different frequency bands by bandpass filtering. Each bit of the 8 bit number corresponds to the presence of audio frequencies within a particular band having energy within known levels and for known time durations. For example, if no such frequencies are present, the binary number 0 (0000 0000) results. If the lowest frequencies occur, the binary number 1 (0000 0001) results. If the next highest frequency occurs the binary number 2 (0000 0010) results. If both the lowest and next highest occur a 3 results. If all frequencies occur the binary number 255 (1111 1111) results. The binary number is the marker which is combined with the video.

It is important to note that by associating the marker with the video signal in the fashion of including it in the active video portion of the signal that the marker will not be lost when all of the sync and blanking (or line, field and other ancillary signals if in digital form) are removed from the video signal such as is done as part of the MPEG encoding process. The association of the marker directly with the image carried by the video signal essentially guarantees that no matter what processing, stripping or modification of ancillary portions of the video signal occurs, in either analog or digital form, or conversion of scanning rates, or adjustment of usual video parameters such as black, brightness and chroma, that the marker will still be detectable at the receive location.

The transmission channel 8 is utilized in the present example to represent any common or independent use or processing of the video signal 1 and audio signal 2 which may cause or result in unequal delays which lead to timing difficulties. Examples of such uses include transmission, storage and further processing, and in particular include storage and/or transmitting of MPEG encoded audio and video signals.

Also it may be noted that marker generators 3 and 13 may respond to video in other forms, or from other parts of the system, or may respond to other signals, for example a genlock reference, in order to achieve proper operation and timing of the marker generator.

It may be noted that the use of the video encoder 5, audio encoder 6 and video and audio combiner 7 is given by way of example, as is usual for MPEG compression and transmission systems which are commonly used in today's television systems. The invention is not limited to the use of such elements however and one of ordinary skill in the art will know how to practice the generation of the marker and the associating of the marker with the video signal in other systems from the present teachings. The combined marker and video signal from 4 and the audio signal 2 may very well be utilized in practicing the present invention without the added elements 5-7.

It will be understood that in the present example the elements 9, 10 and 11 are the receiving side elements complimentary to corresponding transmitting side elements 7, 5 and 6 respectively. As with 5, 6 and 7, elements 9, 10 and 11 are not required to practice the invention. In particular, video from 4 may be coupled, via a transmission channel directly to element 12 and become video signal 16. Similarly, audio signal 2 may be coupled via the same or different transmission channel directly to 13 and become audio signal 17.

In the situation where the transmission channel includes storage of the audio and video signals, and storage and recovery is not performed simultaneously, it is noted that a single marker generator 3 may perform the function of 3 upon the storing of the signals and subsequently perform the function of 13 upon the recovery of the stored signals. Other sharing of circuitry between storing and recovery functions may also be had given the assumption that both are not performed simultaneously.

FIG. 2 shows the preferred form of the marker generator 3 and 13 of the preferred embodiment of the invention as used with television audio signals. Audio signal 20 which may correspond to 2 or the output of 11 in FIG. 1 is coupled to a bank of 8 bandpass filters 21 a-h which are configured to pass only audio within a range of frequencies as is well known in the art. The output of each bandpass filter is coupled to a comparator 22 a-h respectively. The comparators include hysteresis or other threshold(s) and bipolar response characteristic so that if the positive or negative half cycle of bandpassed audio out of the bandpass filter exceeds a threshold amount set by the hysteresis, the output of the comparator is activated. Each comparator output is respectively coupled to a timing duration circuit 23 a-h. Each timing duration circuit also receives a reset signal from the timing circuit 26. The timing circuit 26 provides signals to the parallel to serial converter 24 in addition to the reset signal provided to the timing duration circuits 23. Once the timing duration circuit is reset, it inspects the output signal from its respective comparator 22. If the output signal from 22 is activated for an established time duration indicating the presence of audio frequencies within the corresponding bandpass filter range, the timing duration circuit sets its output active and holds it active until the next reset signal. The outputs of all of the timing duration circuits 23 are simultaneously latched into the parallel to serial circuit 24 upon command from the timing circuit 26 and shortly thereafter the reset signal to 23 is generated. Also shortly after latching, the bits latched into 24 are caused to be output in serial fashion as marker 25. The net effect of the circuitry is to set a bit of the timing signal active corresponding to each of the bandpass audio frequencies which was present during the time period from one reset signal to the next, which corresponds to the time period from the generation of one marker to the next. The timing circuit 26 is responsive to the video signal to set the desired time period between markers, as well as to time the output of the marker 25 so that it is associated with the video signal at the correct time. This action will ensure that the marker is placed at the desired position in the video signal.

The bandpass filters are preferred to be selected to provide frequent outputs with the expected types of audio signals. For commercial television audio signals it has been found that bandpass filters with center frequencies of 25, 50, 150, 400, 1000, 2500, 6000, 15000 Hz and skirts of 6 dB per octave work well. Other center frequencies and bandwidths may be chosen, and the number of filters changed, to facilitate expected audio signal frequency content. Ideally the frequencies would be chosen such that the lowest frequency filter has an output which is active or makes a change of state only once per period of the maximum expected delay differential of the audio and video signal. Alternatively, other audio characteristics may be relied on in the place of, or in addition to, the detection of energy at particular frequencies as described in respect to the preferred embodiment. Examples include, but are not limited to, impulse characteristics, amplitude characteristics, relationships between different frequency energies, relationships among and between different audio channels.

Another example of alternate audio characteristics which may be utilized for the marker is the particular audio sonic characteristics which are relied on for the audio compression. Because these characteristics are already detected in the compression circuitry the present invention may share circuitry thus resulting in lowered cost. Other sharing of circuitry with other functions may be possible depending on the particular signals and environment with which the invention is used.

While it has been described to utilize the marker generator with one audio signal in the preferred embodiment, it will be understood that multiple audio signals may be accommodated, with each having a corresponding marker which is associated with the video. Alternatively a plurality of audio signals may be used to generate a lesser number or even one marker by various techniques which include combining the plurality of audio signals before coupling to the marker generator, or by combining various markers each responsive to one or a small number of audio signals with the various markers being combined into a smaller number or a single master marker.

It may be noted that many audio ICs which are used for audio graphic equalizer functions contain bandpass filters which may be adapted to use in this invention. Of course it is possible to implement the various elements of the marker generator, as well as the rest of the invention, in analog or digital hardware, or software/hardware or combinations thereof.

It will be noted that the present description of the preferred embodiment of the invention is given by way of example. In particular the diagrams of the preferred embodiment are presented as block diagrams and do not show in detail circuitry and cooperation which would be known to those of ordinary skill in the art from the teachings herein without undue experimentation. By way of example it is noted that where one signal line is shown in the block diagram that multiple signals may in actuality be coupled between one block and another, and although separate functional blocks are shown it will be known to make different combinations, arrangements and implementations in order to share elements therebetween and reduce costs. It is also noted that various terms used in the specification, including generator, combiner, encoder, separator, decoder and comparison, and their various tenses are intended to have broader meaning than that ordinarily ascribed thereto with respect to circuit elements, and are intended to cover not only the commonly understood element but the equivalent operation or function as implemented by other circuitry or software/hardware combinations. One of ordinary skill in the art will know to resort to various changes and modifications to the invention as described as well the combination of the invention with other features functions and/or inventive concepts in order to accommodate the use of the invention with particular forms of signals and otherwise to practice the invention in a fashion which is optimized for particular application without departing from the spirit and scope of the invention as hereafter claimed. 

The invention claimed is:
 1. An apparatus for generating a first set of markers in response to a relatively undelayed audio signal having one or a plurality of channels which is part of a digital television program having video comprised of frames and known number of audio samples corresponding to each frame according to one or more digital television standard which first set of markers are intended to be utilized with a second set of markers generated in response to a delayed version of the audio signal to determine delays of audio relative to video portions of the digital television program, the apparatus including: a) an input circuit responsive to one channel or a combined plurality of channels of the digital audio portion of a digital television program to provide a first digital output portion capable of carrying the audio energy present in said channel or said combined plurality of channels over a first range of audio frequencies; b) an input filter circuit comprising a digital filter responsive to said first digital output portion to pass energy present therein which is within a second range of audio frequencies less than said first range of audio frequencies to provide a digital filter output, said input filter circuit operating to provide a first series of binary bits when said digital filter output is activated indicating the presence of energy at audio frequencies passed by said digital filter thus generating a set of binary bits per frame of video; c) a marker generator circuit responsive to said first series of binary bits to provide a series of master markers for each frame of video which is a smaller series of binary bits relative to said first series of binary bits, said marker generator further responsive to the digital video portion of said digital television program to relate the timing of said master markers to the timing of said digital video portion.
 2. The apparatus of claim 1 wherein in b) said input filter circuit provides a first series of binary bits as said digital filter output is activated for an established time duration indicating the presence of audio frequency energy passed by said digital filter.
 3. The apparatus of claim 1 wherein in b) said digital filter is a bandpass filter having a passband frequency range less than said first range of audio frequencies.
 4. The apparatus of claim 1 wherein in b) said input filter circuit comprises a plurality of digital bandpass filters, each having a bandpass characteristic different from the other digital bandpass filter(s) and each having a passband less than said first range of audio frequencies.
 5. The apparatus of claim 1 wherein in b) said input filter circuit includes only one digital bandpass filter having a bandpass characteristic which passes less than all of said first range of audio frequencies.
 6. The apparatus of claim 1 wherein in b) said input filter circuit includes passing energy at low audio frequencies in the range including 25 Hz, 50 Hz and 150 Hz and attenuating energy above 400 Hz.
 7. The apparatus of claim 1 wherein in b) said input filter circuit includes a plurality of digital filters with said first series of binary bits responsive to the outputs of said plurality of digital filters.
 8. A marker generator apparatus for use in a digital television system having digital television programs with video comprising frames and audio comprising a known number of samples per frame defined by one or more digital television standard the apparatus generating a first set of markers in response to a relatively undelayed or delayed plurality of channels of audio, the first set of markers are intended to be related to the video of the digital television program and to be utilized with a second set of markers generated in response to a delayed or undelayed version respectively of the audio, the marker generator apparatus including: a) an input circuit responsive to a digital audio portion of a digital television program to provide a first digital output portion capable of carrying energy in a first range of audio frequencies; b) an input filter circuit comprised of one or more digital bandpass filter responsive to said first digital output portion and operating to pass particular features which include energy present therein within a second range of audio frequencies less than said first range of audio frequencies to generate a digital filter output and in response to said digital filter output generating a set of binary bits for each frame of video of said digital television program; c) a marker generator circuit responsive to each said set of binary bits of b) to provide a corresponding master marker which is a smaller set of binary bits than said corresponding set of b), said marker generator further responsive to the digital video portion of said digital television program to relate the timing of said master markers to the timing of said digital video portion; d) a relative timing comparison circuit responsive to said master markers of c) and a corresponding set of master markers which were either generated before said digital video portion was delayed, or generated after said digital video portion was delayed, and operative to determine the timing relation between said master markers of c) and said corresponding set of markers and to output a relative delay signal indicating the temporal timing relationship, after delay of one or both, of the digital audio portion and digital video portion of said digital television program.
 9. An apparatus as in claim 8 where said digital audio portion of a) is made up of multiple channels of digital audio which are combined to provide said first digital output audio portion.
 10. An apparatus as in claim 8 where said marker generator circuit of c) is responsive to said digital video portion to associate said particular features of b) of said digital audio portion of a) represented by said master markers with one or more frame or field of said digital video portion.
 11. An apparatus as in claim 10 wherein when said digital audio portion of a) and said digital video portion of c) are located in relatively undelayed form at a first television system location, said apparatus further including a second marker generator apparatus substantially the same as said marker generator apparatus and located at a second system location responds to said digital audio portion in delayed form and said digital video portion in delayed form to provide said corresponding set of master markers.
 12. An apparatus as in claim 8 wherein element b) includes a second digital bandpass filter, responsive to said first digital output audio portion and operating to pass second particular features which are energy present therein within a third range of audio frequencies different than said second range of audio frequencies and less than said first range of audio frequencies and to generate a second digital filter output and in response to said second digital filter output generating a respective third series of binary bits.
 13. An apparatus as in claim 8 wherein in element b) said second range of audio frequencies includes passing energy at low audio frequencies in the range including 25 Hz, 50 Hz and 150 Hz and attenuating audio frequencies above 400 Hz.
 14. An apparatus as in claim 8 wherein in element b) said one digital bandpass filter has a center frequency selected from one of 25, 50, 150, 400, 1000, 2500, 6000 or 15000 Hz.
 15. An apparatus for use with digital television programs initially having relatively undelayed video portions comprising frames of video having a corresponding time duration and corresponding audio portions which are related to the video as defined by one or more digital television standard, the apparatus generating a first set of markers in response to the relatively undelayed audio portion having one or a plurality of channels, the first set of markers intended to be utilized with a second set of markers generated in response to a respectively delayed version of the audio signal, with the two sets of markers being utilized to determine the relative timing of delayed audio and delayed video portions, the apparatus including: a) an input circuit responsive to one channel or a combined plurality of channels of the relatively undelayed digital audio portion of a digital television program and providing a first digital audio output capable of carrying the audio energy present in said one channel or said combined plurality of channels respectively over a first range of audio frequencies; b) an input digital filter circuit including at least one digital filter responsive to said first digital audio output to pass energy present therein within a second range of frequencies less than said first range of audio frequencies to provide a digital filter output, said input digital filter circuit further operating in response to said digital filter output to provide first sets of single bit markers indicating the presence of energy within said second range of audio frequencies which energy is passed by said digital filter during an amount of said digital audio portion corresponding to a video frame; c) a marker generator circuit responsive to each set of said first sets of single bit markers to provide a set of master markers which is smaller than the corresponding first set of markers thus providing a series of sets of master markers with each set of master markers responsive to energy passed by said digital filter during an amount of said relatively undelayed digital audio portion corresponding to a video frame; d) a second input circuit substantially similar to a) responsive to a relatively delayed version of said digital audio portion of said digital television program to provide a second digital audio output; e) a second input filter circuit substantially similar to b) responsive to said second digital audio output of d) to provide second sets of markers; f) a second marker generator circuit substantially similar to c) responsive to said second sets of markers of e) to provide second sets of master markers; g) a relative timing comparison circuit responsive to said sets of master markers of c) corresponding to said relatively undelayed digital audio portion of said digital television program and to said second sets of master markers of f) corresponding to said relatively delayed version of said digital audio portion of said digital television program, with said relative timing comparison circuit providing a measure of the relative delay of said delayed version of said digital audio portion of said digital television program and the delayed version of the digital video portion of said digital television program.
 16. The apparatus of claim 15 wherein when said undelayed digital audio portion of a digital television program of a) subsequently becomes said relatively delayed digital audio portion of said digital television program of d) said second digital audio output of d) is substantially the same as a delayed version of said digital audio output of a), said second sets of markers of e) are provided in the same manner as the first sets of markers of b), and said second sets of master markers of f) is provided in the same manner as said sets of master markers of c).
 17. The apparatus of claim 15 wherein in b) said input digital filter circuit incorporates a plurality of bandpass filters with each responsive to said first digital audio output to pass energy present therein which is within a range of frequencies different than the other said bandpass filter(s) and less than said first range of audio frequencies to provide a digital filter output.
 18. The apparatus of claim 15 wherein in b) said input digital filter circuit incorporates a single bandpass filter responsive to said first digital audio output to pass energy present therein within a range of frequencies less than said first range of audio frequencies to provide a digital filter output.
 19. The apparatus of claim 15 further including a circuit to maintain and correct timing errors between said delayed version of said digital audio portion of said digital television program and said delayed version of digital video portion of said digital television program.
 20. The apparatus of claim 15 wherein said input digital filter circuit incorporates a bandpass filter responsive to said first digital audio output to pass energy present at frequencies below 400 Hz and attenuate energy present above 400 Hz by 6 Db per octave or more as the frequency of the energy increases. 